Pitch resynchronization while recovering from a late frame in a predictive speech decoder
نویسندگان
چکیده
The concealment procedure used by CELP speech decoders to regenerate lost frames introduces an error that propagates into the following frames. Within the context of voice transmission over packet networks, some packets arrive too late to be decoded and must also be concealed. Once they arrive however, those packets can be used to update the internal state of the decoder, which stops error propagation. Yet, care must be taken to ensure a smooth transition between the concealed frame and the following “updated” frame computed with properly updated internal states. During voiced or quasi-periodic segments, the pitch phase error that is generally introduced by the concealment procedure makes it difficult and detrimental to quality to use the traditional fade-in, fade-out approach. This paper presents a method to handle that pitch phase error. Specifically, the transition is done in such a way that the natural pitch periodicity of the speech signal is not broken.
منابع مشابه
Pitch Resynchronization Wh a Late Frame in a Predicti
The concealment procedure used by CELP speech decoders to regenerate lost frames introduces an error that propagates into the following frames. Within the context of voice transmission over packet networks, some packets arrive too late to be decoded and must also be concealed. Once they arrive however, those packets can be used to update the internal state of the decoder, which stops error prop...
متن کاملA very low bit rate speech coder using HMM-based speech recognition/synthesis techniques
This paper presents a very low bit rate speech coder based on HMM (Hidden Markov Model). The encoder carries out phoneme recognition, and transmits phoneme indexes, state durations and pitch information to the decoder. In the decoder, phoneme HMMs are concatenated according to the phoneme indexes, and a sequence of mel-cepstral coefficient vectors is generated from the concatenated HMM by using...
متن کاملPitch-synchronous Speech Coding Based on Timbre Vectors
A pitch-synchronous method and system for speech coding using timbre vectors is disclosed. On the encoder side, speech signal is segmented into pitch-synchronous frames without overlap, then converted into a pitch-synchronous amplitude spectrum using FFT. Using Laguerre functions, the amplitude spectrum is transformed into a timbre vector. Using vector quantization, each timbre vector is conver...
متن کاملA New Lpc Error Criterion for Improved Pitch Tracking
A NEW LPC ERROR CRITERION FOR IMPROVED PITCH TRACKING Mohammad R. Zad-Issa and Peter Kabal Electrical Engineering, McGill University, Montreal, Quebec, Canada H3A 2A7 ABSTRACT In Linear Predictive coders the output of the LP analysis lter is used to represent the glottal excitation signal. For high pitched voices during nasal sounds or nasalized vowels, the speech signal takes on a sinusoidal s...
متن کاملKalman tracking of linear predictor and harmonic noise models for noisy speech enhancement
This paper presents a speech enhancement method based on the tracking and denoising of the formants of a linear prediction (LP) model of the spectral envelope of speech and the parameters of a harmonic noise model (HNM) of its excitation. The main advantages of tracking and denoising the prominent energy contours of speech are the efficient use of the spectral and temporal structures of success...
متن کامل